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We study stochastic processes that generate non-growing complex networks without self-loops 
and multiple edges (simple graphs). The work concentrates on understanding and formulation of 
constraints which keep the rewiring stochastic processes within the class of simple graphs. To 
formulate these constraints a new concept of wedge distribution (paths of length 2) is introduced 
and its relation to degree-degree correlation is studied. The analysis shows that the constraints, 
together with edge selection rules, do not even allow to formulate a closed master equation in the 
general case. We also introduce a particular stochastic process which does not contain edge selection 
rules, but which, we believe, can provide some insight into the complexities of simple graphs. 
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I. INTRODUCTION 

Growth processes account for some phenomena observed in 
real networks and they have been studied in many different 
scenarios (for survey see 0,1,0). Recently, more complex 
growth phenomena with node addition and/or deletion are 
studied in pj| and [T3 |. However, there are also many ex- 
amples of networks, mainly in biology (see [li| and associated 
references), in which a complex - and not classically random 
as in [l5| - topology is emerging but no growth process occurs. 
Instead of growth, one can observe a self-organizing stochas- 
tic process which moves some network connections to different 
nodes. Connections in these networks can also be dropped or 
created but the number of edges stays statistically bounded 
within a given interval. Technological networks like the In- 
ternet, once saturated in terms of size, also continue to un- 
dergo stochastic changes in structure due to various types of 
reorganization. Such processes, generally referred to as "non- 
growing" or "equilibrium networks", constitute an important 
class of complex network dynamics and have been studied 
within the framework of network statistical mechanics in 0] 
and @]. 

Although the dynamic behavior of non-growing networks 
has been studied in some detail, one aspect remains elusive: 
to date, the general theory allows multiple edges between 
nodes and self-loops connecting nodes to itself. Yet in most 
real networks there are no instances in which self-loops or 
multiple edges would provide a meaningful model of existing 
phenomena. It would therefore be beneficial to create mod- 
els that stay within the class of simple graphs (no self-loops 
and multiple edges). For example in 3, a large collection of 
real networks and their parameters is summarized, however 
in none of them multiple edges or self-loops would consti- 
tute a reasonable modeling feature. Even in the cases where 
multiple edges do exist (as for example in Internet routing 
network with backup links between the routers) it would be 



more useful to introduce an edge capacity as a model than 
to introduce multiple edges. The experience from modeling 
biological metabolic networks also shows that biologists are 
more interested in being able to quantify the amounts of the 
metabolites through an edge capacity than in a possibility of 
having more edges between the nodes. Therefore, if more ac- 
curate models of real complex networks are needed we have to 
understand how to model the constraints which would keep 
the network without multiple edges and self-loops. 

So far the methods used to approach this open problem [fj 
have had three basic thrusts: 

(i) Under certain conditions, asymptotic behavior in size and 
in time produces networks that are virtually free of self-loops 
and multiple connections [|| (i.e. very large networks will not 
develop many self-loops and multiple edges). However, in the 
case of medium size networks which do not grow over time, 
or in which the stochastic process persists for extremely long 
time, this principle no longer applies in the same way and 
would not explain the structural complexity characteristic of 
this class of simple graphs. 

(ii) Specific constraints are applied to the processes and ini- 
tial conditions as in 0, [l6|] . 

(iii) Steady state asymptotic features of processes for simple 
graphs are studied as in [f|. This approach provides an in- 
sight into the asymptotic behavior but if the states outside of 
an equilibrium are involved (transient phenomena in time do- 
main) or the network is not sufficiently large, a more detailed 
description is needed. 

In this article we introduce another line of thinking, where 
we study how the constraints, which keep the network without 
multiple edges and self-loops can be analytically formulated. 
Our arguments fall into the following parts: 
(i) The work concentrates on a basic process (hereinafter Sim- 
ple Edge Selection Process or S-ESP) in which an edge is ran- 
domly chosen and rewired to a preferentially chosen vertex. 
The process is well established and was studied in the network 
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community (e.g. [J]). It seems plausible to take this process as 
a basic building element of a non-growing network theory as it 
is the simplest imaginable process that goes beyond classical 
random graphs in terms of the process complexity. However, 
the S-ESP process does not ensure that the graph remains 
within the class of simple graphs. To overcome this modeling 
problem a simple modification is introduced, that precludes 
formation of multiple edges and self-loops. The modified pro- 
cess is denoted with SG-ESP (Simple Graph Edge Selection 
Process). 

(ii) We study how to describe analytically the constraints 
which preserve the simple graph property for the SG-ESP 
process. To express these constraints more complex distribu- 
tions than the degree distribution P(k) are needed. Namely, 
we introduce a new quantity P(k, k' , k") called wedge distri- 
bution, which describes the distribution of wedges (paths of 
length 2, see also Figure [3}, where the end nodes have de- 
gree k, k" and the middle node has degree k' . To be able to 
formulate the master equation of the SG-ESP process it is 
also necessary to study in detail the relations between P{k), 
P(k, k') and P(k, k' , k"), where P(k, k') provides information 
how the edges with degree k on one side and the degree k' on 
the other side are distributed. The quantity P(k,k') is often 
named "degree-degree correlation", however as the analysis 
shows, to understand the SG-ESP process we would also need 
distributions of more complex objects than edges, and in that 
case a terminology using "degree-degree" terms would not be 
of advantage. Therefore, we stick to the terminology where 
the distributions are named according to the objects they de- 
scribe. 

(iii) Usage of the edge distribution P(k, k'), the wedge distri- 
bution P(k,k' ,k") and the relations between them allows to 
formulate the master equation for SG-ESP. A master equation 
is a phenomenological equation that describes the dynamics 
of a certain quantity in a stochastic process. In the case of 
complex networks, the basic quantity described with master 
equations is the degree distribution. However, for the SG- 
ESP process the equation is not closed i.e. contains quantities 
that would need more equations to be sufficiently determined. 
The source of the problems is the edge selection rule together 
with simple graph constraints. To gain an insight into this 
complexity we propose to study a certain class of approxi- 
mations to the SG-ESP process which replaces edge selection 
with vertex selection rules and the edge-rewiring rules with 
the deletion and creation of edges. As a concrete instance 
of this class we define a process called VADE (Vertex Based 
Addition and Deletion of Edges) and provide experimental 
evidence that it can approximate SG-ESP in a certain pa- 
rameter range. This approximation by no means provides a 
solution to the problem of simple graph constraints, but it 
shows one possible direction how to understand more about 
the structure of the SG-ESP process. It is also interesting 
that the degree distribution master equation is not enough to 
describe the VADE process, but instead an equation for the 
edge distribution fully captures its behavior. In that sense 
VADE lies between classical random graphs and the SG-ESP 
process. 

(iv) We introduce a systematic method to derive master equa- 
tions for complex processes, where many different configura- 
tion cases can make the derivation error prone or very difficult. 
The method allows us to derive the master equation for the 
VADE process in a very efficient way. However, even if the 
equation is solvable in principle, its nonlinear character and 
complexity would need new methods to provide an insight 



into its solutions. 

The paper is organized as follows: we start with an anal- 
ysis of a well known S-ESP process [j; then we study joint 
distributions of edges and wedges; this leads us to section Hi El 
where we develop a master equation for the SG-ESP process. 
As an attempt to provide a deeper understanding into the 
structure of SG-ESP we introduce an approximation process 
called VADE in section Mil The VADE process is simpler 
than SG-ESP as its master equation shows, and the simu- 
lation results suggest that for some parameter ranges it can 
approximate SG-ESP very well. 



II. NETWORK EVOLUTION WITH SIMPLE 
GRAPH CONSTRAINTS 

Suppose as a starting point a simple graph G(V, E) with 
edge set E, \E\ = L and vertex set V, \V\ = N. We de- 
note with N(k) the number of nodes having degree k, and 
then degree distribution denoted as P(k) can be expressed as 
P(k) = N(k)/N. The averaging of a quantity X is denoted 
with (X) or with X. Specifically, the average degree of the 
network is denoted with k, and it equals k = 2L/N. In the 
cases where we consider changes of the basic quantities like 
P(k) or N(k) over time, we add the parameter t as in N(k, t) 
or P(k,t). 

In the next section we study the S-ESP process and we in- 
troduce a systematic method how to derive a master equation 
for processes like S-ESP. This method is also used in the fol- 
lowing sections to derive master equations for more complex 
processes. 



A. Simple Edge Selection Process 

The Simple Edge Selection Process (S-ESP) is defined by 
the following steps that are supposed to be repeated in each 
discrete time unit t (see Fig. [TJ: 

Process 1 (S-ESP) 

1. An edge, called Ei, is selected uniformly at random. 

2. An end vertex, called Vi, of Ei is selected uniformly at 
random. The other end vertex will be called Vj . 

3. A vertex, called Vi, is selected with a probability pro- 
portional to f(k) i.e. with probability ^M- where k 
is the degree of Vi and (/) is the mean value of f, 
</>=£/( s )P( s ). 

s 

4- The edge Ei is rewired from Vi to Vi i.e. the edge Ei 
between Vi and Vj) is deleted and a new edge between 
Vi and Vj is created. 

The process keeps the number of nodes and edges in the 
network constant, and the function f(k) models preferen- 
tial attachment. Preferential attachment is a concept stud- 
ied in much detail (see for example Q) that aims at mod- 
eling how the agents (processes) in complex networks would 
choose/search an object on which they want to operate. For 
example in the complex network of web pages and links, if 
a web page is updated, the author is often providing links 
(edges) to other web pages. The links are going to pages 
(nodes) which are known to the author and these are the pages 
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FIG. 1: (color online) Illustration of the S-ESP process. The 
process selects a random edge and rewires it to a node which 
was preferentially selected. A detailed explanation of prefer- 
ential selection and the process definition is provided in Sec- 
tion HU The rewiring is unconditional, this means that there 
is no test of the connectivity state between Vi and Vj . During 
the rewiring operation the following three cases can occur: 
In the case A there is no edge between the node Vi and the 
node Vj , therefore the process does not create a multiple edge 
or a self loop and it will keep the simple graph property. In 
the case B there is already an edge between Vi and Vj, and 
the process adds another edge between the nodes. In the last 
case C, if Vi = Vj the process creates a self-loop. 



having already many links pointing to them. In this case, the 
choice where to link is modeled choosing the function f(k) as 
f(k) = k, i.e. the nodes in step [3] of the S-ESP process are 
selected with higher probability if they have higher degree. 

The S-ESP process captures some aspects, like preferential 
attachment, of processes in real networks, however, this pro- 
cess generates multiple connections and self-loops, which are 
rarely observed in real networks. Multiple connections are 
created, because the edge is rewired irrespective of already 
existing connections between Vj and Vi. On the other hand, 
self- loops emerge when vertex Vi is the same vertex as Vj. 

To derive the master equation describing the dynamics of 
the degree distribution for the process above we start with a 
derivation of the probability term expressing the probability 
that Vi is of degree k' and that Vi is of degree k'" . The 
master equation can be constructed parameterizing this term 
with various values of k' and k'" which express all possible 
changes in degree of vertices touched by the process step. 

Selecting uniformly at random an edge and then selecting 
uniformly at random an end vertex of this edge is equal to se- 
lecting a vertex linearly preferentially pj i.e. with preference 
function f(k) = k. Therefore, the probability that vertex Vi 
has degree k! is given by 

Prob[deg(V) = k'] = Xtf(fc') (1) 



Degree of Vi 


Degree of Vi 


Change in N(k) 


k + 1 


k- 1 


+2 


k 


k 


-2 


k + 1 


k'" {k-l,k} 


+ 1 


k' =£{k + l,k] 


fe-1 


+ 1 


k 


k'" {k-l,k} 


-1 


k' ^{k + l,k} 


k 


-1 



TABLE I: The table provides a list of all cases at time t 
that lead to a change in the quantity N(k,t) at time t + 1. 
The minimal configuration describing changes in N(k, t) for 
the S-ESP process is shown in Figure [T] The probability that 
the configuration changes is equal to -j^N(k') ^jv(/) N(k"') as 
described in the text. The table is used to derive the master 
equation for the S-ESP (and SG-ESP) process. The mas- 
ter equation is generated using every row in the table exactly 
once to parameterize the probability configuration term above 
which describes the probability that the quantity N(k,t) will 
change. If the degree parameter column contains a set of de- 
gree values (as for example in the third row, second column) a 
summation over this set must be used. The third column con- 
tains a multiplier expressing how much the quantity N(k, t) 
would change. 



Analogically, the probability that V; is of degree k'" is given 
by 'iv7/\ N(k"'). Since these two selections are independent 
of each other and since the order of selection is strictly deter- 
mined, the probability that Vi is of degree k! and that Vj is 
of degree k'" is 



Prob[deg(Vi) = k' and deg(Vi) = k'"] 



To compute the number N(k, t+1) of vertices having degree 
k at time t + 1 one has to consider the number of vertices 
N(k, t) having degree k already at time t minus the number 
of all vertices that changed their degree from k to any other 
value k! 7^ k, plus the number of all vertices that changed 
their degree from any value k' 7^ k to k. A list of all cases 
in which the quantity N(k, t) changes its value is provided in 
Table [I] For example, the first row in Table U means that if 
vertex Vi had degree k+1 and vertex Vi had degree k — 1 at 
time t, both vertices will have degree k at time t + 1. Thus, 
compared to the situation at time t there are two vertices 
more with degree k at time t + 1 and this is expressed in the 
third column of the table. 

Now every row of Table [T] is used to generate one term in 
the master equation as a parametrized version of the basic 
probability term ([2}. If the degree parameters in the first 
two columns specify a set of degree values k, then a summa- 
tion over this set must be used. After having calculated the 
probability of each case multiplied by its effect (the third row 
of Table U}, we can write down the master equation for the 
number of vertices with degree k at time t + 1: 
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FIG. 2: (color online) Illustration of the SG-ESP process. 
The process selects a random edge and rewires it to a node 
which was preferentially selected. A detailed explanation of 
preferential selection and the process definition is provided in 
Section [TT] In contrast to S-ESP, according to the SG-ESP 
process the rewiring occurs only if Vi 7^ Vj and if Vi is not 
directly connected by an edge to Vj . The process verifies first 
if there is an edge between Vi and Vj and if there is no one 
(case A) the rewiring takes place, otherwise (cases B and C) 
the process leaves the original edge in place. 



N(k,t + T) = N(k,t) 



+1 x 



E 



fc"',fc"v{fc-l,fe} 



(3) 



Writing down all cases and using the fact that 
J2 f( s ) N ( s ) = (f) N the master equation of the S-ESP pro- 

s 

cess [j is obtained: 



N(k,t + 1) = N{k,t) - 



f(k) 
N(f) 



N(k,t) 



+ ^ N{k - l ^-m N ^ t) + k -m N{k + 1 ^ (4) 

Note that in the above derivation the probability that one 
selects twice the same vertex, i.e. Vi — Vi, is not taken into 
account, since this probability is small. Nevertheless, the re- 
sulting master equation [4] is not fully correct. This issue is 
addressed in ral. 




FIG. 3: (color online) Wedges are paths of length 2. The 
graph in this example contains the following 5 wedges: 
(Vs.Vi.Va), (Vi,V 2 ,V 3 ), (V 2 ,V 3 ,Vi), (^,14,14), (V 2 ,V 3 ,V 4 ). 
The wedge distribution W (k, k' , k") is defined in section Hi CI 
In this example the wedge distribution is W(l,3,2) = 2, 
W(2,3,2) = 1, W(2,2,3) = 2. 



B. Simple Graph Edge Selection Process 

A natural extension of the S-ESP process, which would 
keep the process within the class of simple graphs is to intro- 
duce a connectivity test whether there is an edge between the 
target nodes, and if one is detected, the rewiring would not 
take place. The process called Simple Graph Edge Selection 
Process (SG-ESP) can be defined as follows (see also Fig. [5J: 

Process 2 (SG-ESP) The following steps are repeated on a 
graph G in each discrete time unit t. 

1. An edge, called Ei, is selected uniformly at random. 

2. An end vertex, called Vi, of Ei is selected uniformly at 
random. The other end vertex will be called Vj. 

3. A vertex, called Vi, is selected with a probability propor- 
tional to f(k) as in s£ep[3| of the S-ESP process. 

(a) Check ifVi is an end vertex of Ei or ifVi is directly 
connected to Vj. If so, skip the next step. 

4- The edge Ei is rewired from Vi to V;. 

The only difference to the S-ESP process is the step I3al 
However, to express analytically what this means, one has to 
know when the condition in step [3a] is (not) satisfied. This 
means to know the probability that a vertex Vi is (not) di- 
rectly connected to an edge Ei, in other words Vi is (not) a 
direct neighbor of Vj. 

One of the main goals of the present paper is to show that 
we need to study not only edge distributions but also distri- 
butions of higher order objects (wedges) to understand the 
SG-ESP process in the class of simple graphs. In the next 
sections we investigate the relations of these distributions and 
show that they are necessary to analytically express the con- 
dition in step [3a] of the SG-ESP process. 



C. Higher order distributions 

We use the term higher order distribution generally to 
denote a distribution of more complex objects than nodes. 
Apart from the degree distribution P(k), the first higher order 
distribution, which is necessary for further analysis, describes 
a distribution of edges and is defined as 



P(k,k') = 



i^, iffc/fc', 
iik = k'. 



(5) 
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FIG. 4: (color online) Two non-isomorphic graphs Gi and 
Gi with N — 5,L — 5, where the node degrees are 
shown. The graphs have the same degree distribution P(k) — 
(P(l) = §,P(2) = §,P(3) = |) but different edge distri- 
butions P Gl (*,*') = (P(l,2) = ^,P(1,3) = 0,P(2,2) = 
§,P(2,3) = and Pa 2 {k,k') = (P(I,2) = 0,P(I,3) = 
i,P(2,2) = |,P(2,3) = The example shows that in 

general the degree distribution P(k) can not uniquely deter- 
mine the edge distribution P(k,k'), but P(k,k') determines 
P(k) according to equation [5] 



where L(k, k') denotes the number of edges in the network 
whose end vertices have degree k and k', and L denotes the to- 
tal number of edges in the network. The distribution P(k, k') 
does not describe directly the distribution L ( k £ k i but there 
are two important reasons why the above form of the edge 
distribution is necessary: a) P(k, k') can be interpreted as a 
probability that a vertex with degree k has a neighbor with 
the degree k' , and this form can be directly used for the anal- 
ysis of the SG-ESP process; b) it is technically simpler to 
express the relation between P(k, k') and P(fc) as between 
) anc [ P{k). Moreover, if needed, L ( k £ k ) can be easily 
computed from P(k,k'). 

The edge distribution P(k,k') is symmetric, 
P(k,k')=P(k' ,k), because in the definition of L(k,k') 
we cannot distinguish which end vertex of the edge has 
degree k resp. k! . It was reported 0], that 

P{ ^ = \ E p ( fc > fc ') ( fi ) 

k' 

and that for uncorrelated networks the edge distri- 
bution factorizes, namely P(k, k') = Pc(ft)Pz,(fc') where 
P L (k) = ^M. 

The next component needed for analysis of the SG-ESP 
process is a distribution describing probabilities of higher or- 
der objects called wedges. A wedge is an object formed by 
three vertices which are connected together by two edges (a 
path of length 2, see Fig. [3}. Similarly as for edges, we define 
the wedge distribution as: 

f 1 W(k,k',k") ^ jffc^fc" 

P(fc, k', k") = | ^ (fc<fc %») x k = k »[ W 

where W(k, k! , k") is the number of wedges with the mid- 
dle node having degree k' and one of the end nodes with 
degree k and the other with the degree k" . The number of all 
wedges in the network is denoted with W . The wedge distri- 
bution is symmetric in both outer arguments, P(k,k' ,k") — 
P(k" ,k' ,k), but in general P(k, k , k") ^ P(k' ,k,k") and 
P(k,k',k") P(k,k",k'). 

A vertex of degree k is a middle vertex of ( 2 ) = K \ 

wedges. Therefore the total number of wedges is fully de- 
termined by the degree distribution P(k) and is given by 



2 3 2 3 3 2 2 1 

- V ' V" 

23223321 

* V"V" 

FIG. 5: Two non-isomorphic graphs Gi and G2 with 
N = 10, L — 11 and W = 15, where the nodes are 
labeled with the degrees. The graphs have the same 
edge distribution P(k,k') = (P(l,2) = i,P(2,2) = 
^-,P(2,3) = ^,P(3,3) = i) but different wedge dis- 
tributions P Gl (k,k',k") = (...,P(2,2, 3) = ji,,...) and 
Pa 2 (k,k',k") = (...,P(2,2,3) = ^,...)- In general, an edge 
distribution can not determine the wedge distribution of a 
graph, but P(k, k' , k") determines P(k, k') according to equa- 
tion H 

W = V2L 3 Go, where Go = ^( i ^^ :± ) 2 - It can be shown 
0|, that Go equals to the clustering coefficient in the case of 
an uncorrelated network. 

To derive the relation between the edge and the wedge dis- 
tribution we have to consider that k + k' — 2 wedges pass 
through an edge of degree (k,k r ). Further, considering how 
many edges a wedge can contribute to and using correct nor- 
malization, the following relation for k, k' > 1 is obtained: 

Y,P(k,k',k") + P(k",k,k') 
P(k, k') = V2LC~ ^ k - k , _ 2 (8) 

In a different context and differently defined, the distri- 
butions of more complex objects have also been studied in 
0> fToL [3 ■ Another confirmation of the importance of higher 
order distributions comes from the studies of complex inter- 
connection patterns called network motifs. As the authors of 
[IH reported, network motifs observed in the real networks 
clearly distinguish these networks from artificial randomly 
generated instances. 



D. Connection probabilities 

The wedge distribution introduced in the previous section 
can be used to express various connectivity probabilities be- 
tween nodes, edges and wedges. The connection probabilities 
in turn allow to describe the simple graph constraints. 

Assume an arbitrary vertex of degree k and an arbitrary 
edge of degree k',k". If k is different from k' and k" , the 
vertex cannot be part of an edge with degree k',k". In this 
case, we can consider a bipartite subgraph, where one parti- 
tion is represented by N(k) (all vertices of degree k) and the 
other partition is represented by all vertices of the edge set 
L(k',k"). In this subgraph there are N(k)L(k' , k") possible 
edges, but only W(k,k' ,k") of them are present. Therefore, 
the probability that an arbitrary vertex of degree k and an 
arbitrary edge of degree k' , k" are part of the same wedge is 
W(k,k' ,k")/(N(k)L(k' ,k")). Similar considerations can also 
be used for the cases k — k' and k = k", therefore the proba- 
bility Pk,(k\k") that a vertex of degree k is directly connected 
to a vertex of degree k' which is a part of an edge of degree 
k' , k" is given by 
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_ 2W-P(k,k',k"j 

n, ( k>,k») (N(k) _ 5k k> _ Sk k „ )L{k , % n w 

where Sij denotes the Kronecker delta. Analogous reason- 
ing for two distinct vertices leads to the probability that a 
vertex of degree k is directly connected to vertex of degree k' . 
This probability will be denoted as P k k i an d is given by 

2L-P{k,k') 
k ' k (N(k) - 5 k , k ,)N(k'Y (U > 

If k 7^ k' then the edges L(k, k') form a bipartite subgraph 
with the partitions N(k) and N(k'). Therefore, an arbitrary 
vertex of degree k participates in L(k, k')/N(k) edges of de- 
gree k, k! . In general, the number of edges of degree k,k', 
which an arbitrary vertex of degree k participates in, is de- 
noted as 



T(k,k') 



k 



P(k,k') 



(11) 



In an analogous way, the number of wedges of degree 
k,k',k" which an arbitrary edge of degree k, k' participates 
in is 



— 2P(k,k',k") 
V2LC °(2-5 Kkl )P(k,k'Y 



(12) 



This quantity can be also interpreted as the number of 
edges of degree k! , k" that share the vertex of degree k' with 
another edge of degree k, k' . 

The wedge distribution is useful in many other ways to 
express the probabilities of connectivity resp. neighborhood 
relations. For example, the probability that a vertex which is 
a nearest neighbor of an edge of degree k, k! has degree k" is 
given by 

P{k ]k ' k) -j:p(k",k,k'y 

k" 

The wedge distribution can also be useful in obtaining more 
accurate estimations of the local clustering coefficient com- 
pared to the situation when only the edge distribution is used 
(see for further results concerning the wedge distribution) . 



E. Master equation for SG-ESP 



To derive the master equation of the SG-ESP process we 
follow the method introduced at the beginning of this section 
for the S-ESP process. First, we specify a general probabil- 
ity term describing, for the given configuration of nodes (see 
Fig. [2}, the probability that the configuration change happens 
i.e. that the edge will be rewired. A list of all possible cases 
leading to a change in the quantity N(k, t) is the same as for 
S-ESP (see Table [!}. 

Three probabilities have to be considered in order to 
evaluate the overall probability of the configuration change. 
Firstly, as discussed in the previous sections, P(k',k") is 
the probability that vertex Vi in Fig. [2] is of degree k' and 
it has a direct neighbor vertex V, of degree k" . Secondly, 

N(f(k]) ( N ( k "') - S k '",k' - $k'",k") is tne probability that ver- 
tex Vi, which is selected with a probability proportional to 
f(k), is of degree k'" and is not a part of the edge Ei. The 
last probability to consider is the probability that there is no 
edge between Vi and Vj. It is equal to (1 — Py Ak" ,k')) as 
discussed above. Putting all together, the probability that 
the rewiring occurs is given by 



N{f(k)) 



(1-P k '",(k",k'))- (13) 



The enumeration of all possible contributions to N(k, t + 1) 
is identical to the S-ESP process, so after summarizing all of 
them, we obtain the following master equation for the degree 
distribution. In the equation we suppose that the quanti- 
ties (as N(k), P(k' , k") and Pk,(k',k")) related to some object 
distribution are functions of discrete time t, however in the 
following text we omit the time variable t to simplify the no- 
tation in cases where its presence is clear from the context. 



N(k,t + 1) = N{k) - 2_j P ( fc > fc ) ]v(7) ^ ~ p k,(k',k")) 

+ P(k , k ) jj-jj^ (1 - Pk-i,(k',k")) 

Utl\( \T(lJf'\ 



ST pa- ^ f(k'")(N(k>")-Sk„,,k,-5k„,,k) M p , 
2^ p (k,k) jjy^ (1-P k '",( k ', k) ) 



ufu' u , ^f( k '")(N(k'") - S k ,,, k , -8 k w k+1 ) 
+ 2_,P(k,k + l) ! (1 - P fc "',(fc',k+l))- (14) 



I 

To see the relation of this equation to the equation for S- can be written as 
ESP (j4| all terms in the equation above can be expanded in 
a particular way. For example, after expanding all factors of 
the last term on the right-hand side of equation (|14[) , the term 



Graph 


P(k,k') 


P(k) 


E fc / k'P[k',2) 


2 2 11 
A # • • 

• • 


1/2 \ 
{ 1/2) 


(k/2 fc/4) 


1 


1 2 1 

• — • — • 


( l/2\ 


(k/2 fc/4) 


1/2 




[l/2 o) 



TABLE II: This example shows that it is not possible to ex- 
press J2f( k ') p ( k ', k ) b y F (.f( k ), p ( k )), where F denotes an 

k' 

arbitrary function. Consider the two edge distributions in the 
second column of the table, which describe edges with degree 
k, k' — 1,2. Further assume that f(k) = k. Both edge distri- 
bution lead to the same degree distribution (the third column) 
according to equation[6] but the expression Y2k' f(k')P(k' , 2) 
is different. 



*+ijv(fc + i)_ 

Nk K ' 
^ f(k')P(k',k + l) /(fc + l)(fc + l) 

2W ■ f{k"')P(k'",k',k + l) 
N(f)L(2-5 k+1 , k ,) 



(15) 



The other three terms of equation (|14|l can be written in a 
similar way. This form of equation (|14|l shows that all terms of 
the S-ESP process are contained in the master equation of the 
SG-ESP process (the first term in expression (|15[1 corresponds 
to the last term in equation Q etc). The additional terms 
in (|14l) arise from the simple graph constraint and they show 
how well and under which conditions the S-ESP process ap- 
proximates the SG-ESP process. For example, the additional 
terms become negligible in the limit of iV > 1, j^j- <C 1. This 
is an example of conditions under which simple networks can 
be treated as non-simple networks, but the equation (|14[) also 
allows more complex approximations to be investigated. 

The counter-examples in Figures [4] and [5] show, that in 
general it is not possible to obtain P(k,k' ,k") from P(k,k') 
or P(k, k') from P(k). One can also show by counter-example 
(see table HH that ^ f(k')P(k', k) cannot be expressed by 

k' 

F(f(k),P(k)), where F denotes an arbitrary function. This 
means that equation [T3] can not be written in a self-consistent 
way and hence is not solvable in principle. To obtain a closed 
description we need to consider adding equations for higher 
order distributions into the system. 



To see what are the building blocks of SG-ESP master equa- 
tions for higher order distributions we consider first the edges. 
In order to derive a master equation for the edge distribu- 
tion, all situations at time t which will contribute to L(k, k') 
at time t + 1 must be enumerated. For example, if Vi is of 
degree k + 1 we must know how many edges of degree k + 1, k! 
participate in Vi, because all these edges will lose a connection 
and contribute to L(k,k'), too. However, vertex Vi was not 




FIG. 6: (color online) The randomly selected edge Ei touches 
the edges (red colored) that share a common end- vertex with 
Ei. Assuming that Vi is of degree k and Vj is of degree k' , the 
wedge distribution P(k' , k, k") is needed in order to know how 
many edges of degree k, k" share Vi as a common end- vertex 
with Ei, see expression [T2l 



selected independently (like vertex VJ), but it is an end- vertex 
of the edge Ei (see Fig. |5J). Hence, the number of edges of 
a certain degree which participate in vertex Vi is given by 
expression \T%\ This expression is proportional to the wedge 
distribution P(k,k' ,k"). Therefore, the wedge distribution 
is needed to derive the master equation for the edge distri- 
bution. Moreover, when trying to derive master equations 
for wedges, the problem persists: In order to know how many 
wedges of a certain degree share vertex Vi , which is again part 
of an edge with prescribed degrees, we need the knowledge of 
a distribution which is of a higher order than that for wedges. 

The analysis above shows that to fully understand the be- 
havior of the SG-ESP process, a whole hierarchy of objects 
and their distributions must be involved. How far this hier- 
archy above the wedge distribution is reaching would be very 
difficult to know, however one conclusion is that the main 
source of the problems is the edge selection Step [T] of the SG- 
ESP process and the simple graph constraint. A possibility 
how to understand more about this complex phenomena and 
still have a closed description of the process is to introduce 
more complex vertex selection rules which would approximate 
the edge selection rule. 



III. VERTEX BASED ADDITION AND 
DELETION OF EDGES 

In this section we describe a class of processes that pre- 
serves the simple graph structure and that can be modeled 
by a single self-consistent equation. This class will be called 
"VADE", which stands for "Vertex based Addition and Dele- 
tion of Edges" and it aims at approximating the edge selection 
rule of SG-ESP with vertex selection rules. The VADE pro- 
cess is defined as 

Process 3 (VADE) 

The following steps are repeated on a graph G in each dis- 
crete time unit t. 

1. With probability q do the following: 

(a) choose a vertex Vi with a probability proportional 
to /i(fe); 

(b) choose a vertex Vj with a probability proportional 
to f 2 (k); 
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Degree of Vi 



fc-1 
fc-1 
fc'-l 
fc- 1 
fc'-l 

k 
k 
k' 
k 
k' 

fc-1 
k- 1 
fc'-l 
fc'-l 



Degree of Vj 



fc-1 

/ {fc,fc',fc- l,fc' - 1} 

fc'-l 

fc'-l 
/ {fc,fc',fc- l,fc' - 1} 

fc 

/ {fc,fc',fc- l,fc' - 1} 
fc' 
fc' 

/ {fc,fc',fc- l,fc' - 1} 

fc 
fc' 
fc 
fc' 



Change in L(fc,fc') 



+2T(fc - l,fc') 

+T(fc- l,fc') 
+2T(fc' - l,fc) 
+T(fc - 1, fc') + T(k' - 1, fc) + 1 

+T(fc'- l,fc) 

-2T(fc,fc') 

-T(fc,fc') 
-2T(fc',fc) 
-T(fc,fc') -T(fc',fc) 

-T(fc',fc) 

+T(fc- l,fc') -T(fc,fc') 
+T(fc - 1, fc') - T(fc', fc) 
+T(fc'-l,fc)-T(fc,fc') 
+T(fc'-l,fc')-r(fc',fc) 



TABLE III: Similarly as in Table U a list of the cases at time t that will lead to a change in the quantity L(k, fc') at time t + 1 
is provided. The table gives only the cases where an edge is added and where fc 7^ fc'. The cases where the edge is deleted are 
analogical. Cases that follow from permutation are also not listed, for example, the case where Vi has degree fc' — 1 and Vj has 
degree fc — 1 is not listed, because this case follows directly from the case at line 4 by permutation. When considering all cases 
for the edge addition (also the permuted ones) a total of 24 is found, analogically we obtain 24 cases for the edge deletion. 



(c) ifVi — Vj or ifVi is directly connected to Vj, skip 
the next step; 

(d) add an edge between Vi and Vj. 

2. With probability 1 — q do the following: 

(a) choose a vertex Vi with a probability proportional 
to gi (fc); 

(b) Choose a vertex V m with a probability proportional 
to g 2 (fc); 

(c) if Vi = V m or if Vi is not directly connected to 
V m , skip the next step; 

(d) delete the edge between Vi and V m - 

The VADE process is similar to the SG-ESP process in 
several ways. It contains preferential selection parameters 
(/ij /21 flij 92), which give the process the flexibility to ap- 
proximate the edge selection rule of SG-ESP. If the parame- 
ter q is chosen appropriately as we discuss at the end of the 
simulation section, the number of edges is stable in a narrow 
interval approximating the constant number of edges in the 
SG-ESP process. The process also preserves the simple graph 
structure, and the preferential selection parameters can be set 
to generate a degree distribution which is similar to the one 
generated by the SG-ESP process (see the simulation study 
in Section ITV) . 

The reason why the VADE process can be modeled by a 
self-consistent master equation is that all the selection rules in 
the VADE process are defined for vertices; in contrast to the 
SG-ESP process where an edge selection rule is used. Namely, 
the probability that two vertices are directly connected with 
each other requires the knowledge of P(fc,fc') (see equation 
I10[) . and also only from P(k, fc') we can derive how many 



edges of degree fc, fc' participate in a vertex of degree fc, see 
expression Qllj l . This is the reason why the dynamics of the 
VADE process is fully captured in a closed master equation 
for the edge distribution. 

To derive a master equation for this process the two process 
branches (edge deletion and edge addition) can be analyzed 
independently and their contributions can be added together. 
Similarly as for the SG-ESP process, the probability that ver- 
tex Vi is of degree k iy vertex Vj is of degree kj and that the 
conditions mentioned in the step [lc] of VADE are not met is 
given by 



A(ki, kj) 



f 1 (k l )f 2 (k,)N(k i )( y N(k ] ) - 5 hitkj ) (l - P kitkj ) 



(16) 

The probability that vertex Vi is of degree fc; , vertex V m is 
of degree fc m and that the conditions mentioned in the step 
l2"cl of VADE are not met is given by 



D(ki,k m ) = 



gi(ki)g 2 {kj)N(km)(N(km) - 5 kl , km y 



N 2 {gi){g2) 



(17) 

After the analysis of all situations at time t which will con- 
tribute to L(fc,fc') at time t + 1 (there is a total of 48 such 
situations, see Table ITTT|) . and after considering that A(k, fc') 
as well as D(fc,fc') are both symmetric in fc,fc', the master 
equation for the edge distribution can be written as (again, 
for notational simplicity the time variable is suppressed on 
the right hand side) 
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L t+1 {k,k') = L(k,k') + 2qJ2 



-A(k, X)T{k, k') - A(X, k')T(k'k) 



A{k - 1, X) (T(k -l,k') + |«*'-i.jr) + A(X, k' - 1) [T(k' - 1, fc) + ^k-i,x) 
2(1 - q) £ 

-D(/c, X) (V(fc, fc') - i<5fc',x) - fc') (V(fc', fc) - ife./b) 



D(fc + 1, X)T(k + 1, fc') + D(X, k' + l)T(k' + 1, k) 



(18) 
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FIG. 7: (color online) Simulation experiments for a network 
with N=1000, L=2000. The x-axis represents degree values 
and the y-axis the values of P(k), both axes are in logarithmic 
scale. The processes were simulated over 2.10 6 iterations to 
the equilibrium state. The figure shows the simulation results 
for the S-ESP and SG-ESP processes with f{k) = k. To 
obtain the resulting graph a time averaging over 15 instances 
separated with 10 4 states was computed. The S-ESP process 
condensates and all edges are rewired to a few nodes (finally 
to only one node) as self-connections. 



FIG. 8: (color online) Simulation experiments illustrating the 
similarity between the distributions P(k) of VADE and SG- 
ESP processes. The meaning of the axes is the same as in 
Figure [7] and the same averaging procedure was used. The 
parameters for the SG-ESP process are: N=1000, L=2000 and 
f(k) = k except for /(0) = 5. The network stabilizes in an 
equilibrium state with a distribution near to the distribution 
of a scale-free network. The VADE process was simulated 
with the parameters: N = 1000, q = 0.004, = k, /i(0) = 

1000, f 2 (k) =k, gi = l,p2 = 1. 



IV. SIMULATIONS 



This equation is valid for fc 7^ fc'. If k = k', the summation 
term on the right hand side must be multiplied with the fac- 
tor 1/2. The four terms proportional to q account for the case 
when an edge is added, the four terms proportional to 1 — q ac- 
count for the case when an edge is deleted. The terms T(k, k') 
account for the edges that change their degree because they 
participate in one of the involved vertices, whereas the terms 
S k k i account directly for the edge that is added or deleted. 

The aim of introducing the VADE process is mainly to show 
that certain processes can lie in the complexity hierarchy be- 
tween processes described on the level of the degree distribu- 
tion and processes having very high description complexity as 
SG-ESP. VADE also illustrates, which selection rules would 
bring the most problems thus providing a direction where the 
next modeling efforts could concentrate. The description of 
VADE is closed on the edge level and therefore definitively 
solvable. However, to understand more deeply the relation 
between SG-ESP and VADE more research would be needed 
to solve its complex and nonlinear master equation. 



To numerically support the analysis we described in the 
previous sections, we simulated the S-ESP, SG-ESP and 
VADE processes. Our experiments focus on small networks 
and a very long simulation time where a difference between 
the S-ESP and SG-ESP processes is clearly visible. In all 
cases we first generated a classical random network with Pois- 
son distribution and average degree 4.0, which was then used 
as initial condition for all experiments. 

The experimental results in Figure[7]show that in situations 
where the effects of simple graph constraints and of finite 
network size cannot be neglected, S-ESP and SG-ESP behave 
in a very different way. S-ESP condensates, on the contrary 
SG-ESP is developing a highly interconnected kernel. The 
experimental results for the range of parameters in Figure [7] 
also point to the analysis in [f|, especially in relation to the 
analysis of the dense network kernel. 

Another conclusion from the simulation experiments (see 
Fig. [8]l is that the VADE process can approximate the SG- 
ESP process very closely due to the flexibility caused by the 
preferential parameters /i , /2 , pi , <?2 ■ This fact opens a pos- 
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FIG. 9: (color online) The figure shows the simulation results 
for the VADE process with the same parameters as in Fig- 
ure [8] The x-axis represents the discrete time and the y-axis 
the number of edges, both axes are in linear scale. The sim- 
ulation experiments show that the number of edges becomes 
quickly stable. 



sibility of having new processes which have a closed master 
equation but which can approximate the SG-ESP process. 
Figure [8] shows that the VADE process generates a scale free 
network with very small difference to SG-ESP. 

In Figure [9] the number of edges for the VADE process 
is plotted. The experiments show that if the parameter q of 
VADE is correctly set the number of edges will stay stable and 
it will approximate the SG-ESP process. The parameter q can 
be estimated using the following approximation: the number 
of edges will stay constant on average if the probability of 
adding an edge between two vertices with degree k, k' is equal 
to the probability of deleting an edge between two vertices 
with degree k, k! . So one has 



qA(k,k') = (1 - q)D(k,k') 
1 



1 



A(k,k')/D(k,k>) 
1 



/lW/2(fc')(l-f fc , fc /)<gl><g2> 

gi(k)g 2 (k')P k h , </i)</a) 



(19) 



Assuming that fi(k) = gi(k) and f2(k') = g2(k') this equa- 
tion will simplify to 



k,k' 



2LP(k,k') _ 2L 
N 2 P(k)P(k') ~ W 



For the last approximation step an uncorrelated network is 
assumed and both k and k' are replaced with k. For the 
network parameters in Fig. [8] a value of q — 0.004 is obtained. 
This value was used during the experiments and the results 



are approximating the objective value L = 2000 very closely 
(see Figure |9}. 



V. CONCLUSION 

To summarize, the analysis in Section [TT] has shown a 
method how to describe the process constraining steps that 
keep a network evolution process in the class of simple graphs. 
To model these constraints we introduced a new distribu- 
tion that described wedges - the paths of length 2. The re- 
lation between edge distribution (degree-degree correlation) 
and wedges was also studied to understand the evolution of 
simple graphs. A combination of such constraints with simple 
edge selection rules can lead to a very high complexity involv- 
ing several higher order distributions. How far in the distri- 
bution hierarchy one has to go to obtain a closed description 
of the SG-ESP process is not clear and it would need more re- 
search to understand the situation fully. The study of simple 
graph constraints provides further reasons why it is worth- 
while studying (and measuring on real networks) the higher 
order distributions as for example the wedge distribution. 

Up to now, the processes were defined in such a way, that 
it was possible to formulate a closed master equation for the 
degree distribution. However these processes either did not 
respect the simple graph structure or were limited in some 
other way. When assuming that self-organization is the driv- 
ing force for the emergence of complex network structures 
without self-loops and multiple connections, higher order dis- 
tributions are key, in particular for non-growing simple net- 
works. 

The present paper opens several directions for possible re- 
search. Further approximations of equation [14] (or process 
modes and constraints) can provide deeper insight into the 
processes on simple graphs or into the parameter ranges where 
they can be better understood. Another possibility is to study 
in more detail the intermediate VADE class of processes which 
are more complex than classical random graphs but simpler 
than processes containing edge selection rules. These pro- 
cesses can bring more light into the immense complexity of 
simple graphs through approximation of edge selection rules 
with more complex set of vertex selection rules. 
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